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SUMMARY 


The FIFO, a first-in-first-out memory buffer, is an elastic digital memory. Its main application 
is in data buffering between devices operating at different rates. Data written into the top is moved 
autonomously down toward the bottom of the FIFO to the lowest unoccupied location. Data read 
from the bottom of the FIFO will cause data from the top to move autonomously down toward the 
bottom. The concept of the FIFO just described is a simple one, but its implementation is very 
complex. Currently, FIFO is available in MOS LSI asynchronous form with data rate in the | MHz 
region. The FIFO described in this paper yields a simple high-speed iterative implemeatation, 
either synchronous or asynchronous. Because of this si:aple iterative structure, the FIFO is easily 
expandable in both number of words and bits per word, and it is very attractive from the viewpoint 
of integrated-circuit production. For these reasons, this FIFO is called the quasi-perfect FIFO. For 
the synchronous FIFO, a model was built and successfully used in the Controller for the UNICON 
Laser Memory. For the asynchronous FIFO, a model! was built and also successfully used in a high- 
performance magnetic tape Coniroller. 


INTRODUCTION 


The FIFO, a first-in-first-out memory, is an elastic two-port digital data buffering device. The 
designation of a FIFO is usually in the notation of n-bit X m-location. From top to bottom, the 
FIFO is numbered as locations 0, 1, 2,..., m— 1 in that order. Data input to the FIFO can only 
be written into the top, and data output from the FIFO can only be read from the bottom. Both 
input and output can be accessed simultaneously. Data written into the top will automatically fall 
toward the bottom at some rate to the lowest unoccupied location. Data read from the bottom 
also will automatically cause data at the top to fall toward the bottom. Operational status indica- 
tors for the FIFO, such as top-empty, bottom-full, %-full, %-empty, etc., are usually provided. 


The application of a FIFO ranges from slow data buffers for communication systems, queuers 
for peripheral resources management, to real-time high-speed data buffering of digital mass memo- 
ries. At present, FIFOs are available as a MOS LSI chip with a maximum data transfer rate of about 
1 MHz. The implementation of such a FIFO is the use of a random-access memory (RAM) with 
complex control logic consisting of read and write pointers. With such ar implementation, the 
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FIFO is usually an asynchronous device; that is, the FIFO has no provision to be clocked by a 
system clock, whereas a system clock is usually used in the design of large digital systems. To the 
user, the implementation of the FIFO is transparent. Therefore, the user sees the FIFO as a two- 
port device. However, from the hardware viewpoint, the RAM-type implementation approach has 
three obvious difficulties. The fi * is that the speed is limited; second, the control is too compli- 
cated; and third, che device is t. .y a one-port device since a RAM is used. 


The imple mentation of the FIFC to be described in this paper will not use a RAM, but uses 
conventional registers. The control, i..stead of a complex sequential network using counters as 
pointers to control the read and write operations, is a simple iterative network. The network 
consists of identical sequential circuits (SC), one SC per each location in the FIFO. The SC consists 
of one flip-flop and a few gates. The state —< the flip-flop also indicates whether thet FIFO location 
has valid data or not. The implementatic n can be either synchronous or asynchronous. For the | 
synchronous FIFO, the input and outpt data transfer rate is equal to one-half the system clock 
frequency. For the asynchronous FIFO, this transfer rate is equal to one-half the sveed of the data 
registers used, For STTL registers, this peed is in the region of 20 ns. 


PRELIMINARY DISCUSSION 


The FIFO, in its most elementary form, can be described as a two-port network as shown in 
figure 1. The rules for data transfer into and out of the FIFO from and to an external device are 
as follows: 


Input: — Device inputs data into FIFO if and only if (iff) FIFO top is empty (FIFO top is 
not full). 


Output: Device takes data from FIFO iff FIFO bottom is full. 


The above rules for data transfer imply that some system of protocols with the FIFO is required, 
With respect to these data transfer protocols of the FIFO, the system designer must be concerned 
with data rate error (DRE) since the system cannot wait too long for the FIFO to become top-empty 
or bottom-full. A DRE can exist under two conditions: 


Input: — There exists a DRE. if top of FIFO is full when the external device must write a word 
of real-time data into the FIFO. 


Output; There exists a DRE if bottom of FIFO is empty when the external device demands 
a word (in a real-time sense) from the FIFO. 


In orde to prevent against possible DRE unreported, the FIFO statuses of top-empty and bottom- 
full are inipoitant to the system designer. If a DRE does occur as described above, such an error 
condition must be reported to the control computer. The statuses %4-full and ‘%4-empty are also useful 
to the system ¢cesiener if the FIFO is used to transfer data between 2 high-speed mass memory and 
a large central memory (CM) system (usually core or IC). Ins ch aCM system, usually there are 
many devices and processors connected onto it. There exists frequent memory access conflicts 
a betwee’: devices and processors. If the conflicts are high enough such that the FIFO becomes 
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“full (or %-empty dependent upon write or read mass memory), the system designer can use the 
“full status to obtain a higher priority access /.ito the CM system. In a sense, the FIFO provides 
the system designer with a look-ahead status report to prevent possible DRE. 


DE) AILED DISCUSSION 


The FIFO, in its most elementary detailed block diagram form, is shown in figure 2. The FIFO 
shown is an n-bit X-'-location FIFO. Each location consists of a sequential-circuit (SC) and a data 
register (REG). All four locations are identical, thus the FIFO is an iterative sequential network. 

At the bottom of the FIFO, an optional holding-register (HR) is added. This HR addition is mainly 
for interface control. The statuses %-full and %4-empty are formed by two combinational networks 
(CNET). The implementation of SC and the register at each location can be either clocked or 
unclocked. If clocked by a system clock, the FIFO is called a synchrnous FIFO, otherwise it is 
called asynchronous. For a 4-location FIFO, the locations are numbered sequentially as FIFOO, 
FIFO1, FIFO2, and FIFO3. Within the SC of each location, there is only one flip-flop (FF) to 
indicate the status of that location. The output of the FF isQ. A 4-stage FIFO would then have 
QO, O1, Q2, and Q3. A FIFO location full of valid data is indicated by Q = |. otherwise it is empty. 


The operating ruies of the FIFC can now be stated: 


Write: Write into FIFOO iff QO = |, meaning FIFOO is empty, or not full. 
Read: Read from FIFO3 iff Q3 = 1, meaning FIFO3 is full. 


Transfer: FIFOi> FIFOi + 1 iff Qi= 1 and Qi + 1 = 1, meaning FIFO/ is full and 
FIFOi + | is empty. 


Output: First read FIFO3 into HR. Next gate data out from HR when convenient. 
A read of FIFO3 into HR will cause a transfer ripple up from FIFO3 to FIFOO. 


Input: A successful write into FIFOO will cause a transfer ripple down from FIFO3 
to FIFO3, 


% Full: “Full = Q3 Q2 Ql 
| % Empty: %Empty = Q2 Ql Qo 


The write and read rules, as mentiones earlier, require some protocols with the FIFO. For 
example, consider a write mass memory (MM) operation where data flow is from CM ~ FIFO ~ MM. 
Because of the FIFO, the control logic that controls the data flow can be separated into two parts — 
the result is a simpler control for each part. One part is mainly concerned with data flow from CM 
into FIFO. The other part is concerned with data flow from FIFO into the mass memory. The 
control logic that brings data from CM ~ FIFO can make a protocol with top of FIFO. Therefore, 
DRE can never occur. The control logic that tak:s data from FIFO ~ MM cannot make use of 
protocol since MM is usually a real-time device. In this case, possible DRE must be appropriately 
indicated. 


The interstage data transfer within the FIFO is automatic. For example, FIFO! is automatically 
transferred to FIFO2 if FIFO1 is full and FIFO2 is empty. If, however, FIFOO is also full, then 
FIFOO > FIFO1 takes place immediately after FIFO! + FIFO2 is finished. 
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SYNCHRONOUS IMP. SMENTATION 


The synchronous implementa‘ion, as mentioned earlier, is to use a system clock to synchronize 
all logic elements that have memory within the FIFO. That is, all flip-flops and registers are clocked, 
Each location of the FIFO consists of an SC and a data register. The design of the register is a 
simple task, One simply chooses a synchronous register, such as the 74179 (cr 748179), and 
connects many of these registers in parallel until the desired number of data bits are obtained. The 
design of the SC can be obtained directly from the operating rules of the FIFO. The simplest design 
is shown in figure 3 where each SC consists of one JK flip-flop (JK-FF) and one AND gate. For 
SC 1, it consists of FF Fl and AND gate G2. The JK-FF can be any type, however, a 748112 was 
used successfully in the Unicon Controller. Note that the 748112 is a negative-edge triggered FF, 
Consider FIFO stage 2 in figure 3, the input equations for the FF and the register are: 


LOAD2=Q1Q2 —;__ strobe data from REG1 into REG2. 
J2=LOAD2 =; set input to JK FF. 
K2=LOAD3 ; reset input to JK FF, 
LOAD3=Q2Q3_ —;__ strobe data from REG2 into REG3. 


The above equations are the same as the transfer rule stated earlier. For exampie, LOAD? is the 
signal that strobes the data from REGI into REG2. The signal LOAD2 = | iff 01 = | and Q2= 1, 
meaning REG is full and REG2 is empty. Likewise, FF2 is set iff LOAD2 = ., and reset iff data 
transfer from REG? to REG3 is finished. 


In order to further explain the FIFO operation, timing diagrams for write and read FIFO are 
shown in figures 4 and 5, respectively, For the synchronous implementation, note that the maximum 
data transfer rate for writing and reading the FIFO is equal to one-half the syste clock frequency. 
However, internal data propagation from top to bottom of FIFO is equal tc siem clock 
frequency. 


ASYNCHRONOUS IMPLEMENTATION 


The asynchronous implementation of the FIFO is slightly more complicated than the synchro- 
nous implementation. The detailed logic for a 3-stage asynchronous FIFO is shown in figure 6. 
The SC consists of one D-FF (7474), two AND gates, and one one-shot (OS, type Fairchild 9602). 
It should be noted that the D-FF is connected as a T-FF. The purpose of the OS is to provide a 
delay for the data register (74174). This delay must be greater than the sum of the data register 
set-up time plus propagation delay time. For a 748174, this delay should be about 20 ns. Likewise, 
if a 748174 is used for the data register, a 74874 should be used for the D-FF. In this case, the 
speed of the FIFO is limited by the OS because most OSs have a minimum pulse width. Fora 
F9602, this minimum pulse width is about 70 ns. The timing diagram for this FIFO is shown 
in figure 7. 


APPLICATION IN UNICON CONTROLLER 


The UNICON is a digital laser memory manufactured by Precision Instruments, Inc. It is part 
of the mass memory system associated with the ILLIAC-1V computer, and it has an on-line storage 
capacity of about 10'? bits. The UNICON is connected to the CM through a Controller [1]. The 
purpose of the Controller is to control the data flow between CM and UNICON. For controlling 
this data flow, the Controller was designed around the concept of a FIFO, which is used as an 
elastic memory buffer betwee: CM and UNICON. 


The Controller was designed and fabricated in 1973 by the Institute for Advanced Computa- 
tion (IAC). Briefly, the Controller is a modern channel controller using the concept of swapping 
register sets and a minicomputer (a PDP-11) to control its internal memory management functions. 
For this reason, the minicomputer is called the UNICON Memory Management Processor (UMP). 
UMP obtains its control programs through its own virtual address hardware interface into the CM. 
By means of this virtual address space, UMP is capable of addressing up to 4096 pages (512W * 36B) 
in CM. For a write operation, data flows from CM to FIFO and then to UNICON, and vice versa 
for a read operation. As of this writing, error control in the data strea™ is in two parts. The 
controller performs only error detection by means of appending a 1 28-bit checksum at the end of 
a page of data, which is a single parity check symbol in GF(2'**). The UNICON performs both an 
error detection by means of the checksum and error correction by means of a shortened (80, 64) 
Fire Cude for correcting all single-burst errors with burst length b <6. The FIFO is included in the 
viror detection data stream. A future : dan for augmenting the error correction capability is to 
concatenate an outer code to the existing (80, 64) Fire Code [2]. 


As mentioned earlier, the design of the Controller for controlling data flow uses an FIFO in 
the data stream. A functional block diagram of this design is shown in figure 8. The size of the 
FIFO is chosen to be 16WX16B. The UNICON data rate is 3.2 ws per 16-bit word. The CM cycle 
time, including interface protocols, is 1.4 ws per 36-bit word. MDR is a 37-bit (36 data bits plus 
parity) data register interfaced with CM. It also performs the 36-bit to 16-bit word multiplexing. 
HR is a 16-bit register interfaced with UNICON. MISR is a status register containing a word-count 
and status indicators such as DRE, parity error, and checksum error. There are two separate 
control sections called CM-CONTROL and UNICON-CONTROL. These two control sections are 
totally isolated from each other with no controls in common. This is made possible by using the 
FIFO 4s a buffering device in the data stream. In this application, the UNICON can be considered 
as a 32-bit word memory device with a data transfer rate of 6.4 ws per word. For the chosen FIFO 
size of 16W X 16B, the FIFO offers an elastic buffering action of about 49 ys (8 X 6.4), which is 
35 CM cycle times if there exists no memozy access conflict in CM. So for all practical purposes, 
there should never be a DRE occurring during data transfer. 


For a write operation, the FIFO initially is filled with data header information. CM-CONTROL 
is started to transfer data from CM to MDR, and then to the FIFO by making a protocol with the 
top of FIFO to ensure that data is deposited into the FIFO iff top of FIFO is empty. This process 
is repeated until the word count in MISR equals 512. Concurrently, UNICON-CONTROL is 
started to pull data off from the bottom of FIFO into HR, and then from HR to UNICON on a 
real-time demand basis. If the bottom of the FIFO is empty during this pulling, the DRE status in 
MISR will be set. This process continues until the UNICON has pulled 1024 16-bit words off the 
bottom of the FIFO. At any instant of time, the data flow within the FIFO i quite an interesting 
phenomenon. There may be data coming into the top and leaving the botto.a of the FIFO 


concurrently since it is truly a two-port device. As data is deposited into the top, it moves auto- 
nomously down towards the bottom to the lowest unoccupied location at the clock rate. As deta 
is pulled off the bottom (location 15), this action causes the data in the next full location to move 
toward the bottom also at the clock rate. Thus, at any instant of time, it is possible that there may 
be more than one piece of data, starting at v*rious locations within the FIFO, moving down toward 
the bottom of the FIFO concurrently. 


For a read operation, the data flow is very similar to the write operation as just described. 
The differences are that data from the UNICON in real-time is deposited into the top of the FIFO 
through gate G1, and data to CM is taken off the bott »m of the FIFO, 


CONCLUSION 


This paper has presented a design of the FIFO. The design uses registers for implementation, 
which is a deviation from the conventional approach of using random-access memories. The imple- 
mentation can be either synchronous or asynchronous. The control logic for this FIFO design is a 
simple iterative network, using one flip-flop and one gate for each stage of the FIFO in the case ef 
the synchronous implementation, The speed of the FIFO is limited only by the speed of the 
registers used. It is called the quasi-perfect FIFO because it has the characteristics of being simple 
in design, iterative in structure, high-speed, easy to produce from the IC viewpoint, and expansion 
capability in both words and bits per word. For the synchronous design, a model of the FIFO was 
built and successfully used in a controller for the UNICON laser memory. For the asynchronous 
design, a model of the FIFO was built and successfully used in a high-performance magnetic-tape 
controller for the Potter Instruments Model AT1 802. 
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